Design and Analysis of an Optimal Instruction-Retry Policy for TMR Controller Computers

نویسندگان

  • Hagbae Kim
  • Kang G. Shin
چکیده

An instruction-retry policy is proposed to enhance the fault-tolerance of triple modular redundant (TMR) controller computers by adding time redundancy to them. A TMR failure is said to occur if a TMR system fails to establish a majority among its modules' outputs due to multiple faulty modules or a faulty voter. Either multiple consecutive TMR failures the active period of which exceeds a certain time limit or the exhaustion of spares as a result of frequent system reconngurations may result in failure to meet the timing constraints of one or more tasks, called the dynamic failure, during a given mission. An optimal instruction-retry period is derived by minimizing the probability of dynamic failure upon detection of either a masked (by the TMR) error or a TMR failure. We also derive the minimum number of spares needed to keep below the pre-speciied level the probability of dynamic failure for a given mission by using the derived optimal retry period. Any opinions, ndings, and conclusions or recommendations expressed in this paper are those of the authors and do not necessarily reeect the view of the funding agencies.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Optimal Design of UPFC Output Feed Back Controller for Power System Stability Enhancement by Hybrid PSO and GSA

In this paper, the optimal design of supplementary controller parameters of a unified powerflow controller(UPFC) for damping low-frequency oscillations in a weakly connected systemis investigated. The individual design of the UPFC controller, using hybrid particle swarmoptimization and gravitational search algorithm (PSOGSA)technique under 3 loadingoperating conditions, is discussed. The effect...

متن کامل

Evaluation of Fault Tolerance Latency from Real-Time Application's Perspectives

The Fault-Tolerance Latency (FTL) deened as the time required by all sequential steps taken to recover from an error is important to the design and evaluation of fault-tolerant computers used in safety-critical real-time control systems. To meet timing constraints or avoid dynamic failure, the latency of any fault-handling policy | that consists of several stages like error detection, fault loc...

متن کامل

Coordinated Design of PSS and SSSC Damping Controller Considering Time Delays using Biogeography-based Optimization Algorithm

In this paper, a consistent pattern with the optimal coordinated design of PSS and SSSC controller to improve the damping of low frequency oscillations is shown. In this design, sensing and signal transmission time delays are considered as effectiveness parameters. The design problem has been considered an optimization problem and biogeography-based optimization (BBO) algorithm is used for sear...

متن کامل

Branch Recovery with Compiler-Assisted Multiple Instruction Retry

In processing systems where rapid recovery from transient faults is important, schemes for multiple instruction rollback recovery may be appropriate. Multiple instruction retry has been implemented in hardware by researchers and also in mainframe computers. This paper extends compiler-assisted instruction retry to a broad class of code execution failures [l]. Five benchmarks were used to measur...

متن کامل

AN OPTIMAL FUZZY SLIDING MODE CONTROLLER DESIGN BASED ON PARTICLE SWARM OPTIMIZATION AND USING SCALAR SIGN FUNCTION

This paper addresses the problems caused by an inappropriate selection of sliding surface parameters in fuzzy sliding mode controllers via an optimization approach. In particular, the proposed method employs the parallel distributed compensator scheme to design the state feedback based control law. The controller gains are determined in offline mode via a linear quadratic regular. The particle ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IEEE Trans. Computers

دوره 45  شماره 

صفحات  -

تاریخ انتشار 1996